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(m) Flbronectins. 

The polypeptide sequence of the cbltagen-binding end 
fibrin-binding pans of fibronectin and the corresponding 
cDNA laquences have been determined. 

The coliaeen-binding polypeptide is useful in methods 
of purification and the fibrin-binding polypeptide is useful 
for targattlng therapeutic substances on natural fibrin. 
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Fibronectins (FNs) constitute a class of high 
molecular weight glyco-proteins that have a key role in 
various contact processes of the vertebrates such as cell 
a-t-tachment and spreading, cell migration, control of cell 

5 morphology, differentiation and oncogenic transformation. 
All these biological activities imply interaction of FN 
with cells and with extracellular materials. Binding 
activities for collagen, heparin, fibrin, cell surfaces, 
bacteria and DNA have been located in different domains of 

^D the FN molecule (for review, see Yamada , 1983). 

FN is one of the most versatile known proteins, both 
functionally and structurally. FN molecules are usually 
dimers of similar but not identical polypeptides of MK 
250,000. Cellular FN is found in a fibrilar component of 

15 the extracellular matrix of fibroblasts and other cell 
types. Plasma FN is a soluble molecule present in high 
concentrations in plasma OOOng/ml) and probably involved 
in opsonization, wound healing and haemostasis (Yamada, 
1983; Hynes & Yamada, 1982). Partial primary structure 

20 data have revealed high conserved amino acid sequences both 
between the two FN forms and among FNs from different 
species: bovine plasma (Petersen et al^, 1983), bovine 
cellular (Kornblihtt et al, 1983), human plasma (Pande I. 
Shively 1962; Garcia-Pardo et al, 1983), human cellular 

25 (Kornblihtt et al, 1983, 1984a; Oldberg et al, 1983), rat 
plasma ( Schwarzbauer et al, 1983). These data have tended 
to confirm that the basic FN polypeptide contains three 
different types of internal repeats (homology types 1, 
11 and 111, approximately 40, 60 and 90 amino acids long 

30 respectively, as originally shown in bovine plasma FN 
(Skorstengaard et a2 , 1982; Petersen et jbI, 1983). 
Variations on this basic fibronectin structure account lor 
the differences between cellular and plasma fibronectins 
and also .between the polypeptide chains of both forms. 
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The diverse forms of fibronectin seem to be generated by 
transcription of a single gene into a common precursor 
which undergoes alternative splicing { Vibe-Pedersen et al , 
1984). To-date, at least two regions have been described 

£> where this type of variation occurs. In certain human 
cell lines (fibroblasts, Hs578T) FN mRNAs can be 
distinguished by a 270 nucleotide segment (ED) that encodes 
exactly one of the homology type 111. This ED segment 
seems to be absent in the liver hepatocyte mRNAs which are 

10 the source of plasma FN (Kornblihtt et al.» 1983, 1984b), 
Schwarzbauer et al, (1983) have reported three different 
FN mRNAs arising by alternative splicing in rat liver which 
differ in an area (II3CS) located to the 3* carboxy 
terminus side in the protein of the ED region. The 

15 difference sequence does not belong to any of the known 
internal homologies and it is inserted between the last 
two type III homology repeats, near the COOH terminus. In 
addition, Umezawa et (1985) have reported further 
variations in the equivalent IlICS area of human liver 

20 FN mRNA, bringing the total to 5 alternative motifs for this 
area. The differences observed between FN polypeptides 
are thus the consequence of internal primary sequence 
variability (Kornblihtt etal, 1984a 1984b; Schwarzbauer 
et al« 1983), due to alternative splicing in at least two 

25 distinctive regions of the pre-mRNA (Tam)<.un et. a_3_ , 1984; 
Vibe-Pedersen et al, 1984; Umezawa et_ al,, 1 985 ). 

The complete amino acid sequence of mature human FN 
polypeptides has now been determined from the nucleotide 
sequence of multiple cDNA clones as described below. The 

30 polypeptide length varies from 2146 to 2325 amino acids, 
depending on which internal alternative splicing has taken 
place. 

The present invention thus ma}4es it possible to 
provide any desired part of the fibronectin molecule and 
35 in particular polypeptides having each of the separate 

binding activities of fibronectin separate from the others- 
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In the accompanying Figure 2, the binding sites of each i^cart 
of the FN air.ino-acid sequence are given. Wnilf- son-t- of ihcEG 
were previously known, the- sequence for the human collagen- 
binding site (lines 6-13 in the Figure) is new. Linei. 9 and 
5 10, involving the type J I homology, are believed to be of 
particular significance, and to incorporate most or all of 
the collagen-binding ability. 

The present invention thus provides novel polypeptides 
having substantially the amino acid sequence fron. 277 to 577 
10 shown in Figure 2, or any continuous part thereof having 

col lagc r.-bi ndi ng activity, and especially the sequence from 
iTn to 4 4 5. 3 n practice such pel yf.ej t i dc-s may be linked ic 
further an.ino-ac:d residues not affecting the desired end 
use of the collagen-binding polypeptide, including additional 
15 residues of the fibronectin molecule itseJf. 

Similarly other sequences of the fibronectin molecuje 
may be utilized for their ability to bind to other nioleculei, 
Thjs, as shown in Figure 2, the polypeptide sequence frcrh 
21 to 241 i£ associated with binding to fibrin, heparin and 
20 Staphylococcus aureus. Other sequences are, as shown, 
associated with binding to DNA, cells and alternative 
heparin and fibrin binding sites. 

The novel polypeptides may be made by culturing cells 
containing endogenous DNA coding for the polypeptide and 
2S separating the polypeptide from the metabolic products. 
Thus, the appropriate DNA sequences may be cloned into a 
competent strain of E .col i or other m icroorg ansm , e.g. a 
yeast such as S accharomyces cer ev i si ae , the latter 
cultivated, and then the desired polypeptide isolated from 
30 the cultivation products. Figure 3 shows the complete DK*. 
sequence for fibronectin and the associated amino-acid 
residues, and from this the cNDA sequence required for 
cloning the expression of any desired part of the 
fibronectin molecule may be easily determined. 
35 The DNA sequence coding for the collagen-binding 

polypeptide runs from coordinate 1147 to coordinate 1351, 
and the sequence coding for the fibrin-binding polypeptid'? 
runs from coordinate 73 to 738. 

More- F«rticularly, however, in accoraanco wit.n th.- la-eacsT. irKcr-ion, 
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the clones herein described as pFH54, pFH134, pFH16 and pFH6 
and similar clones may be used to produce corresponding 

polypeptides by expression in E. coli or otner appropriate 
iT.i croorgani snis . pFHl 34 and pFH16 contain the DNA sequence for 

5 the collagen-binding part of the fibronectin molecule and may 
be used to generate a polypeptide having the collagen-binding 
activity of fibronectin without its other binding affinities. 
pFH6 can be used to transform competent E. coli for the 
expression of a polypeptide binding fibrin and 

10 heparin. 

The manner in which the clones pFH54, pFH134, 
pFH16 and pFH6 wer£ obtained is described below, and 
repetition of the methods referred to will give similar 
clones with essentially the same or only slightly different 

15 utility. It will be appreciated in this connection that 
the isolation of useful cDNA sequences using tne total 
cellular RNA from an appropriate source, i.e. cells 
naturally capable of expressing the desired protein or 
polypeptide, is a matter of routine experimentation for 

20 the person skilled in the art using the currently 

available techniques, particularly when, as is the case 
here, the actual amino acid sequence for the desired protein 
or polypeptide and the corresponding DNA sequence are both 
known. Tne experimental section below mentions appropriate 

25 techniques which have been found to be effective but it as 
to be expected that other known technique-s would be 
equally applicable. Similarly choice of appropriate 
sources of RNA, vectors, and competent microorgardsms for 
transformation from the many materials which are now 

30 available to the experimentalist is well within the 
ordinary skill in this art. 

It will be appreciated that when a desired amino - 
acid sequence of fibronectin is expressed by a transformed 
microorganism it may be associated with a polypeptide 

35 characteristic of the mi croorgani sn. itself. This may be 
immaterial to the intended use of the polypeptide but, in 
some cases, eg. if the polypeptide is to be used iii tnerapi , 
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the presence of the additional amino-acid residues may be 
unacceptable. In that case the polypeptide must be 
subjected to an additional treatment, eg. with a protease, 
to Beparate the desired polypeptide free from undesired 
additional amino-acid residues. 

As indicated above, the present invention is of 
especiol interest in that it provides a means for transf orir.ing 
an appropriate microorganisin to make it capable of 
expressing a polypeptide able to bind to collagen and/or 
fibrin. A polypeptide able to bind to collagen can be 
used, for example, to facilitate affinity purification of 
valuable polypeptides. Thus if the collagen-binding 
polypeptide is expressed in a form in which it is bound tc 
another polypeptide of interest or if it .is linked, after 
isolation, to such a polypeptide, the combined polypeptides 
may be purified by affinity chromatography on a colun.n cf 
bound gelatin (i.e. collagen), and then, after the 
purification, the desired polypeptide may be separated, 
eg. by an enzymatic hydrolysis, from the col lagen-bind2r,c 
polypeptide. 

A polypeptide able to bind to fibrin may be used in 
therapy to target a therapeutic agent on natural fibrir, , 
eg. a blood clot. For example, a fibrinolytic enzyme 
bound to the polypeptide would have improved clot- 
dissolving properties, since it would have improved 
adhesion to its target. 

In the accompanying drawings. Fig. ^ shows a 
restriction enzyme map of seven cDNA clones covering 76&; 
nucleotides from the poly(A) tail of human FK mRNA. 
Human FN mRNA has been estimated to be 7900 nucleotides 
long (Kornblihtt et al, 19B3). The clones cover the 
complete coding region for the mature protein (bottom 
diagram showing binding sites) and the 3* non-coding recicr.. 
The dotted lines indicate segments that are absent from 
the corresponding cDKA clones but that must have been 
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synthesized In the first strand cDNA reaction and that were 
lost as a result of the failure of the Klenow enzyme to 
complete the second cDNA strand. Numbering on the map is 
in base pairs. 

5 Fig. 2 shows the complete amino acid sequence of a 

human FN polypeptide. ResiducB 1 and 2325 are the NH^ 
and COOH termini of the mature protein. The sequence 
was deduced from the nucleotide sequence (see Figure 3) 
of the cDNA clones depicted in Fig. 1. Alignment shows 

10 internal homologies. Gaps were introduced to maximize 

the homology. Identical residues within a type of homology 
are boxed. The cell recognition tetrapeptide RGDS 
(Pierschbacher t Ruoslahti, 1984) is underlined. Positions 
17 (5er}« 21 (Cys) and 42 (Val) are reported as Cys , Ser 

15 and Ala respectively by Garcia-Pardo et al (198.3). The FN 
polypeptide shown in this Figure has 2325 residues with 
a MW » 255,905. If the mass contributed by the 
carbohydrate side chains, estimated to be 9% of the protein 
mass (Yamada, 1983), is added, the molecular weight of this 

20 FN polypeptide would increase to approximately 279,000. This 
figure appears to be considerably higher than the weights 
of the FN monomers estimated by SDS-PAGE (230-250,000). The 
discrepancy could be explained by tne poor resolution of 
the SDS gels in the range of high molecular weight proteins 

25 together with the lack of appropriate protein standards in 
that range . 

Symbols are as follows:*, free SH groups; A, sites 
for carbohydrate side chains ; A , cleavage site for 
chymotrypsin. J , cleavage site for plasmin. The multiple 
3^> f ibronectin polypeptides can be generated by all the 

possible permutations of the alternative splice regions 
in lines 26 and 30 (as explained below). 

Fig. 3 shows the complete nucleotide sequence for 
the human FN polypeptide of Figure 2 deduced from the 
35 sequences of the cDNA clones of Figure 1. 

Fig . 4 shows the variations of the FN primary 
structure. 

- -s he cc.ple*.'^ structure of the mature protein. Black 

>io are type 1 homologies; dashed boxes, type II homologies; 
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empty boxes, type HI homologies. 

B and C show schematic representation of the different FN 
b polypeptides that could arise by the translation of the 
multiple FN RNAs observed in the ED region (B) and in the 
IIICS region (C). The name of the cDNA clones representing 
the corresponding encoding mRNA species is shown to the right 
of each polypeptide . indicates contiguity. Xrlf2, 4 and 6 

10 were isolated from a rat liver cDNA library ( Schwar zbauer 
et al, 1983), pFHLl and 6 from a human liver cDNA library, 
and pFHl was isolated from the HsSTBT cel3 line cDNA library 
(Kornblihtt et al , 1983). It will be noted that alJ tt,c- 
variations include the zones binding to collagen and fibrin. 

15 Figure 5 shows, in more detail than Figure 1, part of tf.c 

fibronectin protein indicating the position of the caliager.- 
binding domain and internal homologies (I, li, and 111). 
Below, the position and sizes (in base pairs) of the series of 
cDNAs used an the bacterial expression experiments described 

20 below is indicated. Only the restriction enzyme sites 

relevant to the cDNAs pXFKl-8 are shown. The flanking Hind 
HI and Bam HI sites of pFH134 and pFH16 occur in the 
polyl inker of the vector. 

Figure 1 shows the restriction map of different cDNA 

25 clones covering the 3* non-coding region and the complete 
coding region for the mature protein of human FN mRNA. The 
isolation of clones pFHl , pFHlll and pFH154 has been 
previously described (Kornblihtt et aj^, 1963, 1984a) and 
nucleotide sequence and deduced amino acid sequence of the 

30 latter has been published before (Kornblihtt et al , 1984b). 
The clones pFH54, pFH134, pFHlG and pFH6 are new. Isolation 
of these four cDNA clones covering the 5* third of the map 
involved the synthesis of an oligonucleotide primer. The 
sequence (namely 5-GCTGAACCATTTGCTGAGC ) of the primer was 

35 complementary to the mRNA sequence of a region close to the 
5* end of clone pFH154. The oligonucleotide was used to prime 
reverse transcription of total RNA from Hs578T cells (Hackett 
et al, 1977) and a cDNA library was prepared as described 
below. The clones pFH54 , pFH134, pFH6 and pFH16 were selected 
for further analysis. The complete nucleotide sequence of 
these clones was determined and comprised 7692 bp, of which 
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6972 bp correspond to the coding region and 720 bp to the 
3' nontranslatable region and poly{A)tail. The sequence is 
included in the full DNA sequence for FN shown in Figure 3. 

The amino acid sequence of human fibronectin 
deduced from the nucleotide sequence. of the clones in 
Figure 1 is shown in Figure 2. The alignment in Figure i 
maximises internal homologies. The complete FN chain 
presents regions which have 3 different types of internal 
homologies (types I, II and 111 ) ( Petersen et a_l, 1 983) 
and regions which have no homologous counterparts within 
the molecule. The latter, are the NH^-terminal and COOH- 
terminal segments _and the internal connecting strands. Fron, 
NH^-to COOH- terminus, FN is formed by one 20-residue long 
NH^-terminal segment (Fig. 2, line 1), five units of type 1 
homology or fingers (lines 2 to 6), one connecting strand 
(line 7), one finger (line 8), two units of type 11 . 
homology (lines 9 and 10), three fingers (line 11, 12 and 13), 
one unit of type 111 homology (line 14), one connecting 
strand (line 15), fourteen units of type III homology 
(lines 16 to 29, including the ED polypeptide), one 
connecting strand (IllCS, line 30), one unit of type 111 
homology (line 31), one connecting strand (line 32), three 
fingers (lines 33, 34 and 35) and the COOH- terminal 
segment (line 36). 

The primary structure of FN reflects a level of ordei 
and complexity not seen before in any other protein. The 
symmetry in the array of the 16 units of type 111 
homology. is particularly interesting. Two type 111 units 
(Fig. 2, lines 14 and 31) are separated by connecting 
strands (lines 15 and 30) from a central block, having the 
remaining fourteen in a juxtaposed way. The degree of 
homology within the type III units is very high. Three 
residues are conserved in all the units , viz Trp (Fig. 2, box 
with residue 599 at the top). Leu (box with residue 640 
at the top) and Tyr (box with residue 646 at the top). 
The conserved residues are distributed in two pea)ts around 
the Trp and the Tyr, separated by a valley of non homology, 
is believed that the degree of order and conservation in 
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the type 131 sequences must reflect particular constraintF. 

Of the secondary structure of the central region of 

FN. This region is not stabilized by disulphide 

bridges since the only two Cys residues present 

in the type 111 sequences (positions 1201 and 2075 in 

5 Figure 2) haye been shown to exist in a reduced forir, 
(Vibe-Pedersen et al. 1982; Smith et al, 1982). 

Several binding activities have been assigned to 
different regions of the FN moJecuie (see Figs 1 and 2). 
However, only in the case of the ability to bind cells, has 

10 the actual binding site been identified so far. In fact. 
Pier schbacher and Ruoslahti (I9fe4) demonstrated that the 
tetrapeptide Arg-Gl y-Asp-Ser (RGDS) is responsible for the 
cell attachiner<t activity of FN. This tetrapeptide is 
present only once in the sequence of Fig. 2 at positions 

IS 1493 to 1496, within one of the type 111 units. Fig. 2 
also shows that the optimal alignment of the type 111 
sequences in this area is obtained only if the tetrapeptide 
is considered to be an extra element, allowing four gaps 
in the corresponding regions of the rest of the type 111 

20 units. It is probable that, as well as the cell binding 
site, other binding sites or biological activities within 
type HI sequences reside in non-conserved stretches. 
The tetrapeptide has also been found in other proteins 
(Pierschbacher & Ruoslahti, 1984) including theCCchain of 

25 fibrinogen which shows cell attachment activity. 

An important feature of FN gene expression is the 
generation of slightly different polypeptides by 
differential processing of the common mRNA precursor 
(Vibe-Pedersen et al. 1984 ). Figure 4A shows diagramir.aricaily 

30 the localization of the two regions of variability 

observed so far along the FN molecule ( Schwarzbauer et al , 
1983; Kornblihtt et al , 19B4a 19B4b). Figures 4B and 
4C show the types of polypeptide that can arise fron. the 
translation of the different mRNAs generated in the ED 

35 (Fig. 2 line 26) and IIICS (Fig. 2, line 30) regions 

respectively. This diagram combines observations made on 
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both human and rot fibronectin. At least 10 different 
FN polypeptides can be generated from a single gene if it 
be assumed that all the permutations between the ED and 
IIICS Begments are possible. This is consistent with the 

5 FN polypeptide heterogeneity observed in the bidimensiona 1 
gel electrophoresis analysis of cellular and plasma FNs 
found irj vivo . Homo- or hetero- dimeric FN molecules 
can then be formed from the FN polypeptide pool. The 
biological significance of this complex situation is not 

10 yet clear. However, it is to be noted that the ED and 

IIICS variable regions are intercalated between the cell- 
heparin and heparin-f ibrin binding sites. The distance 
between these biologically active sites of the molecule 
may be critical for the FN function. For example, plasma 

lb FN is 1 to 2 orders of magnitude less active than cellular, 
fibronectin in restoring morphology and alignment to a 
transformed fibroblast cell line (Yamada i, Kennedy, 1 979). 
Further the mRNAs carrying the ED segment are present in 
fibroblasts (one source of cellular FN) but not in liver 

20 cells (one source of plasma FN) (Kornblihtt et a_l, 19B4b). 
It is possible that the function of the ED is to increase 
the distance between the cell binding tetrapeptide and the 
heparin binding site, resulting in an enhanced binding 
activity of the cellular FN molecule. 
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EXPERIMENTAL 

RNA Preparation 

Human cell line Hs578T (Hackett et al, 1977) was 
cultured in Dulbecco's modified Eagle's inedium containing 
10% foeto] calf serum. Total RNA was extracted from 
confluent cell monolayers by the guanidine -HCl method 
5 (Chirgwin et a2 , 1979). Between 2 and 4 mg of total RKA 
were extracted froir. 4 x 10^ cells. 

Other sources of RNA could be used if preferred, 
eg. fibroblasts or liver cells. 

Isolation of fibronectin cDNA clones 

All the cDNA clones depicted in Figure 1 were 

10 obtained using Hs578T cell RKA as template. Isolation o: 
clone pFHI by oligonucleotide probing was described by 
Kornblihtt et (1983). Isolation of clones pFHIll and 
pFH154 by "mRNA walking" (oligonucleotide priming) was 
described by Kornblihtt et al (1984a). This latter 

15 procedure was used for the isolation of the new clones 

pFH54, pFHl34, pFHi6 and pFH6. An oligonucleotide primer 
complementary to the mRNA region close to the 5' end of 
pFH154 was synthesized by the method of Gait et a_l (1S)6C). 
The oligonucleotide was used to prime reverse transcriptior, 

'40 of total RNA from Hs578T cells (Hackett et a_l, 1977). 
Blunt ended ds cDNA was prepared and cloned into the 
plasmid pATl 53/Pvul 1/8 (Anson et al, 1984) in £. col i 
MCI 061 as previously described (Kornblihtt et al., 1983). 
Colonies were screened using as probe a restriction 

25 fragment from the 5* end of pFHlE»4 lacking the primer 
sequences, labelled by filling in at one end. In this 
way, clones pFH54 and pFHl34 were obtained. In a second 
step, clones pFH 1 6 and 6 were obtained by screening witii 
an end labelled probe for the 5' end of clone pFH134. 



0207751 



- 12 - 

hc sl riction fragments of the fibrojiectin cNDA wore- filed in 
wath the Klenow fragment of DNA polymerase I and blunt end 
ligated into Smal cut/phosphatased pEX 1, 2 or 3 vector. 
Transformations were carried out using the E. coli strain 

5 LKlll (Zabeau et al^ # 1982) harbouring the plasmid pcl857 
which specifies kanamycin resistance and carries the cl857 
allele (Remaut et aj.. 1983). Colonies were transferred to 
Whatman 541 filter paper (Gergc-n et a_l» 1985) and screened 
with either 3' end labelled (Maxam et al^ . 1977 ) or nick 

10 translated probes (Rigby et al. 1977). 
Sequence determination 

Inserts frorr. clones were excised f ron, the vtctcr UNA b>- 
digestion with appropriate restriction enzymE^s, separated in 
agarose gel electrophoresis, and recovered by el ect r oel ut ion 

15 (Girwitz et a_l» 1980). Most of the sequencing was ptrforp.id 
by the chemical degradation procedure of Maxam and Gilbert 
( 1980). Some regions were sequenced by the chain te rr.i ria*.ct 
method (Sanger et al, 1977 ). For that purpose, the rele\ar,-^ 
fragments were isolated, digested either with Al u l or Hac 1 1 I 

20 and ligated to a Sma l digested Ml 3rT.p9 vector (Messing & 
Vieira, 1982), previously treated with calf intestinal 
phosphatase to prevent its circular ization . Tne ligation 
mixtures were used to transform competent E. coli JMl 01 and 
recombinants were selected as clear plaques. by insertional 

25 inactivation of the B-galactosidase gene- (Messir.c e t a] , 

1981). Single stranded DNA was prepared by stanoai u pi\.K.--.Aij: 
(Winter & Fields, 1980) and the inserts were seqjencoa usin>^ 
a "universal" 1 7-nucl eotide long primer (Duckwortli ot^ , 1981). 
Preparation of bacterial extracts 

30 Bacteria carrying recombinant plasnr.ids were grown at 30'C 

for 2*5h and expression of the cr o/ C -ga 1 actos i da se- fusioj. 
protein induced by sl^iifting to 42^C for 2h. Bacteria v^rc 
pelleted at 1200 g and washed with 50 mM TrisHCl, pH 7.4, 
170 mM NaCl . Cells were resuspendt-d in the same buffer 

35 containing lysozyme (2.5 mg/n.l ) and sonicated lor 2 r.i;. or. 
ice. The lysatc was centriluyeu at 45,000 g for 30 ir.u. at 
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A^C. The pellet was resuspended in 7 M urea in 10 inM Tris 
HCl , pH 7.4, ImM EDTA and incubated at room temperature for 
30 min. The solubilised extract was dialysed extensively 
against SO mM Tris HCl pH 7.4 at 4°C> and then centra fuged 

5 at 45.000 g for 30 min at 4'*C. 
Gelat in-Sepharose chromatography 

Gelatin-Sepharose was either obtained from Sigma 
Chemicals (St. Louis, MO, USA) or prepared by linking gclatari 
(pig skin type I, Sigma Chemicals) to CNBr -activated 

10 Sepharose CL.4B (Pharmacia, Uppsala, Sweden). Chr oma too r apr.y 
of bacterial extracts on gelati n-Sepharose was carried out 
as described by Ruoslahti _al (1982). The efficacy of the 
gelatin-Sepharose matrix was verified using purified human 
plasma fibronectin (Sigma Chemicals). 

15 El ectrophoretic analysis 

SDS-polyacrylamide gel electrophoresis was carried out 
in 0.1% (w/v) SDS in Tris/glycine buffer on 7.5% (w/v) 
acrylamide slab gels (19).. Gels were stained with 0.1% 
Coomassie blue in methanol/water/acet ic acid (4:5:1 by vol.). 

20 Jmmunoblotting was performed as described by Towbin et al. 
(1979 ). Polypeptides, ele.ctrophoret ical ly transferred to 
nitrocellulose were probed with rabbit. anti( human plasma 
fibronectin) serum (1:500 in phosphate buffered saline, 10% 
newborn calf serum and 0.05% Tween 20). Bound irnnaiogloL-Jlir 

25 was visualised using alkaline phosphat.ase conjugated goat 
anti-(rabbit IgG) (1:1000; Sigma Chemicals). 
Protein assay 

Protein was estimated by the method of Bradford (1976) 
using bovine serum albumin as a standard. 
30 Construction and characterisation of fibronectin expression plasrrdds 

The human fibronectin cDNA clones, pFHl34 and pFHl6, 
encompass all or part of the collagen-binding domain of 
fibronectin identified by proteolytic cleavage of the protein 
(see Fig. 1 and Fig. 5). These cDNAs were therefore chosen 
35 as the starting point for investigating the expression of a 
functional collagen-binding site in E. coli . The pEX vectors 
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used for cloning enable exogenous gene sequences to be- 
inserted into a polylinKer in all three reading fran.es at 
the 3' end of a cro-LacZ hybrid gene under the contro] of 
the- 7\ promoter (Stanley et al, 1984). The 5' ends of 

S the 1.74 kb and 1.04 kb inserts of pFH134 and pFHl6 

respectively were sequenced (Maxam et al_, 1980) to establish 
thcr reading frames of the cDNAs and blunt end cloned into 
the- Sn.a l site of pEX.2 . The recombinant plasmids were 
introduced into an E. col i strain previously transformed 

10 with a plasrr.id encoding the temperature-sensitive- 

repressor, cl857. This allows for temperature- i nduci LI < 
expression of the cro/fl-galactosidase protein. To test for 
the prodjction of fibronectiii fusion protein by the- 
expression constructs, hybridisation positive cloijes were 

15 grown at 30'C for 2H h and then shifted to 42*C for a further 
2 h. Total bacterial lysates were analysed by SDi poly- 
acrylamide gel electrophoresis. Five of ten pXFHi 3 4 
constructs and one of sever. pXFH16 constructs showio tne 
production of high molecular weight pol ^'pept idt-sof si7c?- 

20 consistent with the lengths of the cDNA inserts ( Ibb ku 
and -^165 kD respectively. The correct orientation of th'i 
fibronectin sequences in pXFH134 and pXFH16 was confirn.vd by 
restriction enzyme analyses. 

The fusion proteins produced by pXFHl34 and pXFHl6 

25 accounted for approximately 20\ of the total bacterial 

protein consistent with the previous report for this vertoj 
system (Stanley et a^, 1984). Both fusion proteins showed 
some proteolytic degradation, particularly the pXFHl34 
polypeptide, which appeared to be partially cleaved to the 

30 size of the wild-type cro/B-galactosidasc (116 kD) . . Analysis 
of proteins synthesised over a time course of induction (0 to 
120 min) indicated that proteolysis occurred concon.itant ly 
with synthesis of the fusion prott-ins. 

The expression of fibronectin antigenic determinants 
35 in pXFH134 and pXFHl6 was investigated by inm.unobl ot t i ng 

using a rabbit polyclonal ant i - ( hun.fjn plasn.a f ibroricct m ) serur:.. 
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The antiserum reacted with the 185 JtD polypeptide 
synthesised by pXFH134 but not with the pXFH16 fusion 
protein or the cro/B-gal actosidase polypeptide, indicating 
that the epitope(s) recognised by the antirserum lie 

6 outside the type 3 1 homology units and adjacent type I 

repeats (Fig. 5). This observation is consistent with the 
poor antigenicity of the collagen-binding domain of human 
fibronectin previously reported (Ruoslahti et al_, 1979 ) and 
most probably reflects the very high level of amino acid 

10 conservation in this region (Fig. 2). 

Gelatin-Sepharose affinity chromatography 

Over-production of B-galactosidase fusions in E. col i 
results in the precipitation of the protein in the cells as 
insoluble inclusion bodies (Williams et al, 19B2, Cheng lb83, 

16 Stanley 1983). Thus, when bacteria expressing the pXrH134 
plasmid were lysed by sonication and centrifuged, the 
fibronectin fusion protein was found exclusively m the 
insoluble pellet. This fraction represented approximately 
50%, of the total protein of the bacterial lysate. Solubilis- 

20 ation of this material required treatment with 7 M urea and, 
following dialysis, 60% of the protein remained in solution. 
This fraction, which was highly-enriched in the fusion 
protein, was applied directly to a 5 ml gelatin-Sepharosc 
column equilibrated in 50 mM Tris HCl , pH 7.4.. The column 

25 was washed with 0.5 M NaCl in 50 mM Tris HCl, pH 7 . 4 until 
the E28O °^ flowthrough was <0.01. Tlie cro/fl-galactosidasr'- 

fibronectin hybrid protein was eluted from the. column as a 
single symmetrical peak with 4 M urea in the same buffer. 
Under these conditions fibronectin is specifically released 

30 from gelatin-Sepharose (Ruoslahti et al, 1982). In the 

control experiment using pEX2 only, no binding of the wild- 
type cro/fi-galactosidase protein was observed. 

A functional collagen-binding site has therefore been 
reconstituted in the pXFHl34 fusion protein. It must be 

35 noted, however, that the fusion protein specifically eluted 
from the column represented < 5% of the fusion protein 
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applied to the column. Thus, not surprisingly, considerable 
activity is lost due to the insol ubil i sa tion of the fusion 
proteins in the bacterial cells, and subsequent vigorous 
treatment required to resolubilise them. 

5 The fibronectin fusion protein produced by pXFHl e was also 

tested for gelatin-binding and showed similar activity to 
pXFH134. This indicated that the collagen-binding region 
occurred within the domain defined at the protein level and 
strongly implicated the two type 11 and adjacent type I 

10 homology units (see Fig. 5). To further localise the binding 
site, a series of overlapping expression constructs was rr.adt 
from pFHl6 (Fig. 5) and systematically assayed for gelatin- 
binding activity. The results are summarised in Table 1, and 
show the consistent involvement of the type II honi&logy 
units (pXFN 2, 3 and 6). 

Table 1 



pEX construct Binding to 

Gelatin-Sepharose 



PXFH134 
pXrH16 
pXFNl 
2 
3 
A 
5 
6 
7 
B 

pEX2 vector only 
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The binding activity of pXFH134 is almost entirely accounted 
for by a construct consisting of the two type 11 homology 
units (pXFN3). By comparing the gelatin-binding activity of 
pXFN3 and pXFN6 (both active) with pXFNS and pXFN8 (both 

5 inactive), it may be deduced that the amino acid sequence 
critical for binding lies in the C-terminal half of the 
fibronectin fragment in pXFN3, and more particularly from 
the Hinfl site (coordinate 1147 of Figure 3) to the Rsal 
Bite (coordinate 1351 of Figure 3). This 66 amino acid 

10 sequence represents almost the entire second type II 

homology unit of fibronectin plus a few amino acids of the 

adjacent type 1 homology unit (see Fig. 2). 
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CLAIMS 

1. A polypeptide sequence having specific affinity 
for collagen and/or. fibrin and containing amino-acid 
residues in substantially the same order as in the amino- 
acid residues in the collagen binding and/or fibrin binding 

5 part of fibronectin. 

2. A polypeptide according to claim 1 comprising at 
least the collagen-binding portion of the amino-acid sequence 
277 to 577 shown in Figure 2. 

3. A polypeptide according to claim 1 comprising at 
10 least the collagen-binding amino-acid sequence 37y to 445 

shown in Figure 2. 

4. A polypeptide according to claim 1 comprising at 
least the fibrin-binding portion of the amino-acid sequence 
21 to 241 shown in Figure 2. 

15 5. A polypeptide according to any of claims 1 to 4 

bound to a polypeptide sequence not present in fibronectin. 

6. Method of purifying a substance which comprises 
contacting a conjugate of that substance and the collagen- 
binding portion of fibronectin with immobilized collagen 

20 so that the said conjugate binds to the said collagen, and 
then eluting the said conjugate. 

7. Method according to claim 6 in which the said eljted 
conjugate is then split to remove the said collagen-binding 
portion, and the said substance is then isolated. 

25 8. A polypeptide' according to claim 1 or 4 comprising 

a fibrin-binding amino-acid sequence bound to a therapeutic 
agent. 

9. A cDNA sequence coding for the polypeptide sequence 
of fibronectin having specific affinity for collagen and, or 
30 fibrin. 

10- A cDNA sequence according to claim 9 having the 
structure shown at 1147 to 1351 in Figure 3b. 

11. A cDNA sequence according to claim 9 having the 
structure shown at 73 to 738 in Figure 3a. 
»5 12. A plasmid or other vector containing a cDNA sequence 
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as defined in any of claims 9 to 11. 

13. A microorganisin modified by inclusion of a vector 
as claimed in claim 12. 

14. Escher ichi a col i modified by inclusion of a 
vector as claimed in claim 12. 
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CLAIMS FOR DESIGl^ATED STATE AT 

1. Process for producing a polypeptide sequence 
having specific affinity for collagen and/or fibrin and 
containing aitiino-acid residues in substantially the sarre 
order as in the amino-acid residues in the collagen binding 
5 and/or fibrin binding part of fibronectin, which coir.prises 
colturing a cell containing an exogenous DNA sequence 
coding for said polypeptide eequence, and separating Seid 
polypeptide sequence froir. the metabolic products of said 
eel 1 . 

10 2. A process according to Claim 1 in which the saic 

polypeptide comprises at least the coll ag en- b i nd i ng portion 
of the arr.ino-acid sequence 277 to 577 shown in Figure 2. 

3. A process according to Claim 1 in which the said 
polypeptide sequence comprises at least the collagen- 

15 binding air.ino-acid sequence 379 to 445 shown in Figure 2. 

4. A process according to Clsiir. 1 in which the said 
polypeptide sequence comprises at least the f ibr i n-bi nd i nc 
portion of the aTr.ino-acid sequence 21 to 241 shown in 
Figure 2. 

20 5. A process according to Claim 3 in which the 

exogenous DKA has the structure shown at 1147 to 1351 in 
Figure 3b. 

6. A process according to Claim 4 in which th^ 
exogenous DNA has the structure shown at 73 to 738 in 

25 Figur e 3a . 

7. Method of purifying a substance which corprises 
contacting a conjugate of that substance and the collagen- 
binding portion of fibronectin with immobilized collageit so 
that the said conjugate binds to the said collagen, and 

30 then eluting the said conjugate. 

8. Method according to Claim 7 in which the said 
eluted conjugate is then split to remove the said collagen- 
binding portion, and the said substance is then isolated. 
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